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ABSTRACT 


Sequence analysis of a substantial part of the 
polymerase gene of the murine coronavirus MHV-A59 
revealed the 3’ end of an open reading frame (ORF1a) 
overlapping with a large ORF (ORF1b; 2733 amino 
acids) which covers the 3’ half of the polymerase gene. 
The expression of ORF1b occurs by a ribosomal 
frameshifting mechanism since the ORF1a/ORF1b 
overlapping nucleotide sequence is capable of inducing 
ribosomal frameshifting /n vitro as well as in vivo. A 
stem-loop structure and a pseudoknot are predicted in 
the nucleotide sequence involved in ribosomal 
frameshifting. Comparison of the predicted amino acid 
sequence of MHV ORF 1b with the amino acid sequence 
deduced from the corresponding gene of the avian 
coronavirus IBV demonstrated that in contrast to the 
other viral genes this ORF is extremely conserved. 
Detailed analysis of the predicted amino acid sequence 
revealed sequence elements which are conserved in 
many DNA and RNA polymerases. 


INTRODUCTION 


The genome of mouse hepatitis virus (MHV), a coronavirus, 
consists of an infectious single stranded RNA molecule of 
approximately 30 kb in length (1). After entry the viral genome 
is released, translated into the RNA dependent RNA polymerase 
and subsequently used as the template for the transcription of 
negative stranded RNA of genome length (2, 3). This RNA then 
serves as a template for the synthesis of six subgenomic mRNAs 
and genomic RNA. The subgenomic mRNAs form a 
3’-coterminal nested set. An unusual feature of the mRNAs of 
coronaviruses is the presence of an identical leader sequence. 
This common leader is proposed to result from a unique leader- 
primed transcription mechanism. The transcription and translation 
strategy of coronaviruses has recently been reviewed in detail (4). 


EMBL accession no. X51939 


The RNA dependent RNA polymerase of coronaviruses is a 
multifunctional protein: it contains the activities necessary for 
the transcription of negative stranded RNA, leader RNA, 
subgenomic mRNAs and progeny virion RNA. In addition it is 
likely to possess capping activity. These activities and the 
protein(s) on which they reside are poorly characterized. 
Complementation studies using temperature sensitive (ts) mutants 
which are defective in RNA synthesis revealed six different 
complementation groups, indicating that a large number of genes 
or at least activities are involved in the synthesis of the viral RNAs 
(5, Wan der Zeijst, personal communication). Several authors 
have shown the presence of membrane associated RNA dependent 
RNA polymerase activity in lysolecithin permeabilized MHV- 
AS9 infected cells (6, 7) or cytoplasmic lysates of MHV infected 
cells (8, 9). Brayton et al. (10) have described one early and 
two late polymerase activities in lysates of MHV-A59 infected 
cells. The early polymerase activity was shown to be involved 
in the synthesis of negative stranded RNA, while the late RNA 
polymerase activities were responsible for the synthesis of 
genomic RNA and subgenomic mRNAs. 

In vitro translation of genomic RNA of MHV-AS9 resulted 
in the synthesis of a protein with a molecular weight exceeding 
200 kd (11). Jn vitro this protein is cleaved into a 220 kd and 
a 28 kd protein. The p28 protein is the N-terminal cleavage 
product of the precursor protein (11, 12) and has also been 
identified in MHV-AS59 infected cells (13). 

The nucleotide sequence of the gene encoding the RNA 
polymerase (pol) of the avian coronavirus infectious bronchitis 
virus (IBV) has been determined (14). The pol gene, which is 
about 20 kb in length, contained two large open reading frames 
(ORF) ORF la and ORF 1b (previously termed F1 and F2) which 
potentially encode polypeptides of 441 kd and 300 kd, 
respectively. The ORFs overlap by 42 nucleotides, ORF 1b being 
in a —1 reading frame with respect to ORFla. Brierley et al. 
(15, 16) showed that a cDNA fragment spanning the 
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ORF 1a/ORF Ib overlap was able to direct ribosomal frameshifting 
in vitro and in vivo. 

Recently, we have completed the molecular cloning of the 
genome of MHV-AS59 and determined the nucleotide sequence 
of the p28 coding region at its 5’ end (1). Here we present the 
predicted amino acid sequence of ORF 1b and of the carboxy] 
terminal region of ORFla of MHV-A59. In addition, we 
demonstrate that a conserved nucleotide sequence of the 
ORF 1a/ORF 1b overlap directs ribosomal frameshifting in vitro 
and in vivo. 


MATERIALS AND METHODS 
Isolation of viral genome RNA 


Virus obtained from roller bottle cultures of Sac(—) cells infected 
at 2 p.f.u./cell was purified on sucrose gradients as described 
before (17). Viral genomic RNA was isolated as described 
previously (18). 


cDNA synthesis and cloning 


First and second strand cDNA synthesis were carried out as 
described by Gubler and Hoffman (19) using viral genomic RNA 
(50 pg/ml) as a template and calf thymus pentanucleotides 
(100 pg/ml) as random primers. Reverse transcriptase was 
obtained from Promega, RNasin from Amersham, Escherichia 
coli DNA ligase from New England Biolabs, RNase H and DNA 
polymerase I from Boehringer. After phenol/chloroform 
extraction and ethanol precipitation approximately 0.3 pg cDNA 
was used for homopolymeric tailing using dCTP (20). Samples 
were taken every 30 seconds during tailing and the reaction was 
immediately stopped by adding 0.1 volume of 1 % SDS containing 
100 mM EDTA. After phenol extraction and ethanol 
precipitation, the tailed cDNA samples were annealed to Pst I 
digested, oligo-dG tailed pUC9 (Pharmacia). For transformations 
(21) Escherichia coli strain JM109 was used. 


Subcloning for M13 sequencing 


DNA restriction fragments were separated by agarose gel 
electrophoresis and isolated by binding to NA45 membranes 
(Schleicher and Schuell). Purified fragments were recloned in 
M13 vectors. When no convenient restriction enzyme sites were 
available plasmid DNA was digested with Rsa I and the DNA 
fragments were directly subcloned in Sma I cut M13mp19. M13 
clones were selected by hybridization to nick translated purified 
DNA fragments of the region to be sequenced and screened by 
single track sequencing. 


DNA sequencing 


Single stranded DNA from M13 clones was sequenced using the 
Klenow fragment of DNA polymerase I (22) and [??P]a-dATP 
or T7 DNA polymerase (23) (Sequenase, US Biochemicals) and 
[5S}a-dATP. Double stranded DNA was sequenced using T7 
DNA polymerase according to the instructions of the 
manufacturer. Sequence data were analyzed using the computer 
programs of Staden (24) and Wisconsin (version 5, 1987)(25). 
Comparison to the National Biomedical Research Foundation 
(NBRF) protein identification resource was made using the 
program FASTA (26). 


Construction of a plasmid for the analysis of ribosomal 
frameshifting 


To study the potential ribosomal frameshifting in the pol gene 
of MHV-AS9 the expression vector pBBMAC was constructed 


as follows. Plasmid pMAC (a gift of Peter Rottier), which 
contained a copy of the MHV-A59 membrane (M) protein gene 
of which the region encoding the amino acids 121 up to 196 (27) 
was deleted (full details on pMAC will be published elsewhere), 
was digested with BamH I[ and filled in using the Klenow 
fragment of DNA polymerase I. The M protein encoding DNA 
fragment was purified and ligated into Sma I cut Bluescribe(+) 
vector (Stratagene, USA). A clone containing the MAC coding 
region downstream of the T7 promoter was selected. The unique 
Sma I cleavage site of the resulting expression vector pBSMAC 
was converted into a Bgl II site by an 8-mer linker addition, 
resulting in pBBMAC. 

Clone P638, which contains the ORFla/ORF 1b overlap (Fig. 
1), was digested with Pst I. The 1.3 kb cDNA insert was purified, 
digested with Alu I and ligated into the filled-in and 
dephosphorylated EcoR I site of a Bluescribe plasmid. After 
transformation bacterial colonies containing the 160 bp Alu I 
fragment spanning the ORF1la/ORF 1b overlap were selected by 
colony hybridizations (28) using a nick-translated BamH I—Kpn 
I 850 bp cDNA fragment of clone P1136 as probe. Clones 
containing the expected 160 bp insertion were sequenced. The 
resulting plasmid pAPO was digested with EcoR I and made blunt 
ended with Klenow fragment of DNA polymerase. After ligation 
to 12-mer BamH I linkers and digestion with BamH I the DNA 
fragment was purified and ligated into Bgl If cut, 
dephosphorylated expression vector pBBM C. Plasmid DNA 
isolated from the resulting transformants was sequenced to 
determine the orientation and the borders of the polymerase 
ORF la/ORF 1b overlapping fragment of several clones. Clone 
pAP! was found to be correct and used for analysis of the 
potential ribosomal frameshifting. 


In vitro transcription and translation 


Plasmid DNA of pAP! was purified on a CsCl gradient (29) and 
linearized with Hind II. Jn vitro transcription and translation were 
performed as described (30). 


4567 


Figure 1. Cloning and sequencing strategy of the ORF 1b region of the MHV- 
A59 polymerase gene. A) The upper line represents the MHV genome. Vertical 
bars and the numbers above indicate the junction sequences involved in the initiation 
of the transcription of the corresponding mRNA. Open boxes represent the open 
reading frames in the polymerase gene. B) The open bar represents the sequenced 
region of the polymerase gene. The black triangle and the open triangle indicate 
the start of the large 3‘ORF (ORF1b) and the junction sequence for the initiation 
of mRNA2 transcription, respectively. The positions of oligonucleotides A and 
T12 are indicated. Negative numbers mark the distance to the start of the poly(A} 
tail of the genome. The numbered bars refer to the cDNA clones used for 
sequencing, the sequenced areas are indicated in black. 
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In vivo expression of pAP1 


Hela cells (210°) were infected with recombinant vaccinia 
virus VTF7—3, which contains the T7 RNA polymerase gene 
under the control of a vaccinia promoter (31) at a m.o.i. of 10 
p.f.u. At 90 min. post infection (p.i.) the cells were transfected 
with pAPI (5 yg) as described by Gorman (29). At 14 hr. p.i. 
the cells were labelled for 30 min. with 60 pCi/ml 
{°5S]-methionine (32). Cell lysates were prepared (33) and 
clarified for 90 min. at 12.000xg (4°C). 


Immunoprecipitations of proteins 


Immunoprecipitations of the [*°S]-methionine labelled proteins 
were performed as previously described (32); 4 pl of in vitro 
translation mixtures or 150 yl of the cell lysates were used. A 
monoclonal antibody (moab) J.1.3 (a gift of John Fleming and 
Stephen Stohlman; ref. 34) directed against the aminoterminal 
region of the M protein of MHV-A59 and an anti-peptide serum 
raised against the carboxyl-terminal 18 amino acids of the M 
protein (Rottier, manuscript in preparation) were used as MHV 
membrane protein specific antisera. 


RESULTS 
Molecular cloning of the polymerase gene 


Initially we used a synthetic oligonucleotide complementary to 
the conserved junction sequence immediately upstream of the 
nucleocapsid gene to prime the cDNA synthesis on purified 


vTML AL WH P LLeSRRPTT& QOS Y CCAS c c 
TOTGACCATECTGOCALT Gat: ATOOCEATTACTATT ‘ANC GEAGECCAALCALTAA) ICAGEATTETTATGETCEIGCTTCOGTITETATATATTEECE 


1 

1 

M SRVEHPOVOELCKLROKFVQVPtGCIKOPYSYVL 
100 CTCECE TON TEMCATCEAGATETTEATESATIGTOCAMIT ACGLIGGCAAGTTTGTCCAAGTCLOCTTAGGCATAMAAGATCCTGTGICATATGTGTT 
i) 

1 
99 


“THOS CUVCEFWRO CSCS Cv eT ES oFOS ko Tart 
L 
CACECATIATAS TTSTEMGSTI TETEEETTTTORERAGATRTAGLTETTCC 


P¥SwWKRHEL 


wz NGFEVQY ; , 
RR TRC TS VM ARLVPCASCLOTOVQLRAFOICHA 
25R MANCGGATTOGOGE TACANGTOTAMATGLOCGTCTIGTACCCTCTECCAGTEGCTTEGACACTGATGTTCAATTAAGGGCATTTGACATITGTAATGLT 


52 WR AGIG Y KVECCRFQORVOEOGHKLDKFFY VE 
37 AATCEAGETCRCATTCET TTGTATTATAAAGTAATTECTECESCT TOCROCETSTAGAT TASGACECAMCAMGTTGGATAAGTICTITGTICTI AAA 


OS RTNLCE VY RKREKECYELTRECCYVYAEWMEFFTF DY 
196 | ACTAATITMCAASTOTATAATANGCAGAAAGAATOCTATAGTTGALAAAACHATCEGTCTTET OSE TEAACACCAGTICTICAZATTICATGT 


9 €6C SRVPHIVREOLSKFTMLOL ALR UFORNDE 
CTTTCAMAGTTT TTTTGACCLAATCATTCT 


S05 GAGCEAARTCOGST ACTALACA TAGT CCST AAAGAT| ALTAISTIMCATCITIECTATGEATTOCETCA 
I} STL KE TLL TYAECEESYFQOKKOWYODF VER POT I 
4 TEAM CTAARAMATICTCCEEATATCTCATETOMMIETEC ALT T CCANANGANGEALT GOT A’ ATGATTTTETTGAGMTCCTGATATAATY 


mom ye kK Loe RALLMTAKEFADALYEAGLVGY, 

m3 AMTCTTATAWARGETICTCETAATTTATARACETCTTAACACTOCOGTTTOEAHCEAT ACTOCACCEAGECTTNGTAGETCTITTA 
27 71089 LYGQWYOFGDFYKTVPEGCCEVYAVADSYY 
832) ACACTTGAT, AATCAMCATTTATATOSTCAATEST ATGALTTIGGAGAT TTTGTCAMGACASTACCTGG TTGTGSTET TGCOGTOSCAGALTCTIATTAT 


750 WHPKRLTACHALOSELFYNGCTYREFOLVQYOF 
1] TEATATATCATGCCAATOCTEALTATSTETCATCEGTIGLATAGTOAGTISTITVTAN AATOGTACTTATAGGCAGTITGACCTICTICAGTATGATTTT 


a3 C«*ST DFKLELFIKYFRWWVSHTYWPHTCECEOORCII 


1090 ACTA TANT ACACTCTTCACAAGATTTIAMEACATATCACTACCE EMAC TSTOASTECEACHTOAAGSTECATATI 


36 WC AWE OR FSMVLPKICFREPLYRQIFYOGYPFEFY 
11BD CATTGODLCAATTTTAATATACTTTTTASTATGGTCTIACETAMCACCTETTTISOCEICTTCTTAGGCACATATITGTCEATOETOVTCETTVCGYT 


“a VSITGCYHYKELEVYH MH OVOTHRYRLESLKOLiLY 
1288 STGTCOATCSST TASCATTATGAATIAGTOTI GE TATEAATATEGATGIGGATACACATCENTAICOCTIGICTCTIANGACTIOCIDTIGIAT 


SZ AADOPALEVASASALLDLRICCESVAALTSCOVEKF 
1387 CCTCONGACOLTCOCLTTCATCTGGOETCTOCTACICEACTOCTTCATITGORCACATETIGTITTAGCCTTECAGETATTACAACTCECGTAMAATTT 


as QiverCuF OQOFYEFILS KELL KE CSSVYOUKEF 
L485 CAAMLASTIAAACCTCGAAATTTTAATCAGGATTTTTATGAGTTTATTTTCAGT MAGGCL TGLTT AAASASSIRASE TCCGTTGATTTGANGCACTTC 


“3S FFTODGHAAT TOYeyrneyNtPTAVOTKOLLFYE 
1585 TTCTTTAGGCASCATSTAATOCTELTATTACTOATIATAATTATIACAAGTATARTCTACECACCATGGTGEATANT AAGCAGTTSTTETTTETTTTA 


@l ev var Ter TYEGCGCCIPATQVYIEYVNNYOKSAGYP TF 
1664 COASTICITAATAASTATTTTEAGATCTATEAGECTECCTET AT ACOCGLAACACASSTCATTGTTAATAATTATGAT AMGAG TOL TG FATOCATTT 


$14 eR CK At yc ALS PEt Qo ei Ay Tie wv oe Tt 
1783 AATAAATTTGGAAASGCCASGL TCTATTATGASGLATTATCATT TGAGGAGLAGGATGAAATTIATGLGTATALCAAACGLAATETULTCCOGALIL TA 


kell TQHMLKEYALSAKNRARTYACYSTLSTHATGR AF O 
1882 ACTCAAATGAATCT TAMATATOLTATTASTOC TAAGAATASCSLOLGL ACG TT! CTATICTCASTALTATGALTOSCASAATET TTCAT 


mo OK CUEKSTAATREVPYYIGCTIKF YG GWDOALARRL 
1981 CAMASTCTCTAAASASTATASCAGC TACT COCGGTGT ICC TETAGTTATASGSCACLACCAAGTICTATOLCGGTTGGEA TGATATSTIACLOGOCTT 


613 D¥VOSPYLAGWOYRPKCORARPNITLRIVYSSLVE 
2080 AT AWGATTCAASICENTACATCTTGGMCTATETAM ETERECTCEATECEMACA  MCTCOGTATTGTTAGTASTTTGSTOCTA 


a ARK HOSCCS RLANECAQVYVLSEIYNCES 
2179 COCCETAMACATCATIOSTECTETTCECATACECA AEATICTATCRTCTTOCCAACCAGTECRCOT RAGTITTGASTCAAATICTTATETGTORTOCT 


es CY YVKPGCETSSCOATTIAFANSYFMICQAYSANY 
2278 TCTTATTATCTT AAACCASGTOSCALTAST AST GGA TOCAMCLACTCLITTTGCTAATTCTGTOFITAACATTIGICAASL IGT FICCECLAATGTA 
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genomic RNA. Virus specific clones were mapped by dot blot 
analysis with the purified individual MHV mRNAs (data not 
shown). Oligonucleotide A was synthesized from sequence data 
obtained from a cDNA clone which only hybridized to genomic 
RNA. Oligonucleotide T12 was synthesized on the basis of the 
nucleotide sequence of RNase T1 resistant oligonucleotide T12 
(based on the nomenclature of ref. 35; unpublished results), which 
is located in the pol gene of MHV-AS9 (36). Oligonucleotides 
A and T12 were used to screen the random primed genomic 
cDNA library. Several large cDNA clones (P095, PO96 and 
P098) hybridizing to both oligonucleotides were identified (Fig. 
1). In addition several cDNA clones were identified which were 
only positive with oligonucleotide A e.g. P030 and PO035. The 
cDNA clones P096 and PO30 contain the complete coding 
sequence of mRNA 2 (37) hence they map to the 3’ end of the 
polymerase gene. The cloning of the pol gene was completed 
by screening the cDNA library using a nick translated probe 
derived from the most 5' mapped cDNA clone on the viral 
genome. Clones P638, P737 and P1136 were isolated from a 
second random primed cDNA library prepared against genomic 
RNA from an independent stock of MHV-AS9 (1). 


Nucleotide sequence analysis 


The cDNA clones used to determine a substantial part of the 
nucleotide sequence of the pol gene of MHV-A59 are shown (Fig. 
1). Each nucleotide of the consensus sequence (Fig. 2) was 
determined on at least two independent cDNA clones. Analysis 


nz ¢ $ uGHKS EDLSITREL QKAL Y YyYR AO HK Y 
27 TecTeecrTaTecear Rear achcarebcarresra TACELGAGTT ACAAAMGCELCT A) ATACTCTAATETCTATCETSCRGACEATOYT 


m5 DPAFVYSEYYEF LEK FSWHITLSDOGVYCYASEF 
2476 GACCOCGCATTTGTTAGTGAGTATTATGAGTTITTAAA TANGCATITTACTATCATEATTTTCAGTEATEATCETETTGYCTETIAT ATAATT CAA TTT 


™m ASKEY I AMISAFQQYLYY 9 WVEFRMSEARCWYVET 
275 GICTCCANGGETTATATTECTAATATANETCLT TTCAACACETATTATATTATCAAAATAATGIGT TAT TET CTCAGGLIAAATCTTGOGT AGAAALA 


Ml OLTEKGPHEFCSQHINLVYVKNOGDEYYLPYPDPSR 
2674 GACATCGAAAMGGGACLRLATGAATTTTCTICTCAACATACAATCC TAG TCAAGATGGATGGTGATGAAGTCTACCTTCCATACCLTEATCL TTCCAGA 


Oo TL GAGCCFVYDOLLKTOSVLLITERFYSLAILDAYPL 
IMGMGOGTTTOGTAAGTCTICCAATTIGA 


277 ATCTINGEACAGEETCTTTTGTTEATEATTTATTAMMGACTEATAGOSTICTCTICAT. TECTIATOCTITA 
wr vyYw eure YQuYyFRY “Eri KKLYNDLGuQrios 
2672. GTATACCATEACAACCEAGAGTATCAMAATETGTTCUGESTATATTTACAATATAT MAGAAGCTTACAATEA TCTOSETAATCAGATECTCEACASE 


910 Le 
2971 TACAGTETTATTTTAAGTACTTGTGATGGTCAMAAGTT TACTGATGAGALCTTTTACANGMACATTATTTAMGAAGTCCAGTOCTGLMAGCGTTOST 


943 ACYV¥VCSSQTStLRCESCIRKPLLELCCKOCAYOW HS 
w70 CECTRESTISICISTAGT IE FCAMCATEATIACS! FETeSCAST IGEN LAT ACECANGELT TT OCT CTE TTGCAMATOCOLETATGATCAIGTIATGTC 


76 HKYVLSVYSPYVCHSPECOVNOVIKLEYLGOGAMS 
3109 _ALTCATCATAAATATGTCCTGAGTCYSTEACCATATCTGTGTAATTCALDGGGATETEATCYAMATCATGYTALLAMATIGTATTTAGETOSTATGTEA 


109 YY CEDHKPQYSFRLUVHNGNVFELYKOSCIGSPY 
3268 TATTATTGTGAGGACCAT AAACLACAGTATTCATTCAAAT TGS TGATGAATGSTATOSTI 11 Tela TIATATAAACAATCTTGTAC TORT ICOCOL TAL 


er LE OF MKITASCKVTEVOOYVLARECTERLKLFAA 
3307 ATAGAGEATTTTAATAMAATAGE TAGTTECAAATOGACAGAMGTCEA TEA TTATETSCTAGCTAATAATECALCEAACECC FTAAATTGTTTCRCGEA 
AMCAATETTAT! 


Ne Tene VAPPLOKnYVYF TOY WFTW OKT VUE CE YY FD 
3565. ATTCETAANGTGAGACCACCAC TTAATAMMAATTATGTIT TTACTGOCTACCATTTTACTAATAATEGTAMGACAGTT TT AGLTGAGTAIGTTTTTGAT 


5 ETQKATELEAFKQOCYASATIRETYSDRELILS WE 
3466. GAAACGLAGANGGOCACAGAMGAGGOCTTT: COGTCNGEAACGATCOGTGAGA TCGTCAGLGATCGGGAGTIAATTTTATCTTGGLAA 


ma KSECTM EV YYRATITYE LSV¥VGDYFILISHAVSS 
64 AACAGTOAGTICACTAATERICTGIATIA TCL EC EALAALOALT TA TAAGTTATCIGTAGETCATCYETICATTTTAACATEACACEENSYCTETALT 


14 LSAPTLEVPQENYTSITRFASVYYSV¥YPETFOR NY PW 
3783. TTANGTOLICCTACAT TASTACCGCAGGAGAATTATACTAGLATTOGTITTG£TAGTETTTATAGIGTGLE TGACACGTTTCAGAATAATCTOOCTAAT 


1207 YORTGERKRYCTYVQGPPCTGOKSHCAILGLAYYYCT 
3BE2__TATCAGLALATTGGAATGAACLILTATTOTALTGTACAGGGACIDLL TOG TALTCETAAGTCCCATCTAGLCAT Telelal T AGL TGTTTATTATIGTACA 


1240 ARVYYTAASMAAVYDALCEKAHKFLE OI # OD 
3961 CORLRCSTOSTCTATACCEL TOL TASLCATELT GLAST TGACELEL TG TOT GAAAASELACA T: 


1273 PAK VRVOCYOKFRVNOTIRKEYYFTTINALPELY 
4050 CLTGLAAAGGTCOGTCTASATIGTIATCATAAATTT ANGGTCAATCACACLALTCGCAAS TATETGTTTACTALAATAMATGCATT ACLTCAGT TOOTS 


c TR i v 
ABATTTTTAAATATTAATGACT GCACGCETATTCTT 


106 Tord LS 
Aig) ACTOACATTATTCICOTICATOAMGTTACTATECTTICAACTATCAGCTGTEICTTATTAMCAGICEIGYTALECE TAAGCATTATCYSTATATTCEA 


1399 sa eee a cu eee ae 
4258 CADCCTECOCASTTACCTGCACCACGTGTGLTACTEAATAAGGGAALTCTAGAACCTAGATATITT AATICCSTTACCAASCTAATETGTTGTTTOGGT 
2 POT RFLGCTCYROPKETVOTYSALYY ANKLE KAKO 
4357 CAGATATTTTCTTGGSCACCTETTATAGATGLLCT MSSAGATT GT @SATACOSTCICASOLTTESTTTATAA TAT MSL TEAASEL TMAAATEAT 


MOS #SSHCFKVY YK CQTTHESSSAVHANHQQIHLIS KE 
4458 AATASCTCCATGTOCTITAASSTTTATTAT ARGGGOCAGACT ACACATGASAST IC TASTOL TOTTI AATATOLASCAMATACATTTAATTASTAASTTT 


1438 LEAMPSWVSHUAVFISPY HS OMYVAKRYLE LOT QT 
4555S TUAAASECAAACE ELAS TTGRASTAMCRLORTATTFATIASTCL TAT AA TAS TCASAALTA TCT TOL TAASAGASTCT ThA TT ACAAALCLAGALA 


Ml VOSAQCSEYOFVIYSQTAETAHS YN VNRF UV AT 
4654 GTASACTCASLSLASETICTON TICTETCAA 


(TATGATITIGTTATTN \TGTAAATAGATTCAATGTTOCTATT 
164 TRAKRKEILCY s's mgt rest ee to ae 
MASAASEGT| 


4753 MCACGTOLT, aTacevaretcaTASTATATECMTI TT TGASTCTCTIAATI TT AC TACAL TACIT TGA] AASA TTAALAATCLA 
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19397 RLQCTIMtEFKOCSRSY¥E YH PAHKHAPSFLAYWYOOR 
4852 COAT TACAGTOTACTACAMATT TOIT TAAGGAT TCI AGLAGGAGL TATGT AGGATATCACTCAGCILATECACCATCLOTT IT TEGCASTIGATOALAAA 


1570 YKVECOLAVOCLHEVAOSAVIYSRUEIStAGCFKELOL 
4951 TATAAGGTAGGLOGTCATTTAGLOGT ITLL TTAATTTGLTGATICIGLTGICACTTATTCGCGGLTTATATCALTCAT CRSA TICANGL TTGACTTS 


1603 TLOGCYCKLEFITROEATRRYRAWVVEFDAEGARAT 
5050 ACCCTTGATGGTTATTGTAMGCTOTTTATAALTAGAGA TGAMGL TATCAMACTGT TAGAGCLT GGG T TCL TTUGAT GLAGAMGETOCOCATELGATA 


a 
5149 COTGATAGCCATTGSGACAAATTICOCATTACAAT TAGGL TTT TCRACTOGAATTGATTT TCT TGTCOAMGCTACTOGAATGT I TGLTGAGAGAGAT GGT 


1669 YTVEFRKAAARAPPCEQFKHLEPLNSREQKWOVYR 
5248 TATGTCTTTANAAAGGLAGCUGCACGAGL TCC TCL TOGCGAACAAT TT AAACACETTATCOCACTTATGT CANG AGEL AGAAA TEGGATETOSTTCGA 


m2 1RIVQRLESORLYDLADSV¥VYLYIWAASFELTCUR 
S347 ATTAGAATAGTACAMATGTTGTCAGACCACCT AGTOGA TTTGGCAGACAGTGTTGT ACTIGTOACETGGGL TGLLNGLTITGAGCTCACATETTTELGA 
17S YRFAKVGREVVYCSY¥YCTKRATCFNSRTITCYYCCOWRH 
SAAS TATTTOGCTAMAGTTGGAMGAGANGTTGTCTGTAGTOTCTGCACCAMGLGTGUGACATGTTTTAATTCTAGAACTORATACTA 


mae SYSCOYLYNPLIVOTQQWGCYTESLETS aN dDPICS 
S5A5 AGTTATTCCTCTGATTACCTGTACAACCCACTAATAGTTGACATTCAACAGTGOGCATATACAGGATCITTAACTMGCAATCATGATCCTATTTGLAGL 


ol YHK GAH AS SDAIMTACLAYMOCFCKESV¥YANWALE 
55a4 CTCCATANGOETECTCATCYTGEATEATCTEATECTATCATEACCCRSTEICTAGETEY TCATCAT TECTTTTETAAGTETETTAATTECAATTIAGA 


14 a 
5743 TACCCCATTATTTCAAATGAGGTCAGTGTTAA ‘AATET TT MGGGCTGOGATOLTATOCAATAGGTATCATCTG 


1987 Eee aye ee re DASPYVKSVEQE 
SBK2_ TETTATEACATTOUCAMCECTASAGETCTTEOCTETGTCAMAGEATATEATT TT AAGTITTATEATCOLTECLCTCTTGTTANGTEIGTTAMACAGTTTI 


wo vYKYEANKDOQFLOGLONFWACHYDKYPANAYWYVC 
5941 GTTTATAAATACEAGGCACATAAAGA TCAATTTTTAGATGGTITGTGTATGT ITTGGAACTGCAATETGGAT AMGTATCLAGLGAATECAGTT GTGTET 


WB RF OTRYVLAKUEUNUCPCCMECCSLYVHKHAFAHYTS PFT 
OAD NGGTTTCACACGCGTCTCTTGAACAAATTMMATCTCCCTGSCTETAATGSTGGCAGTITGTATGTI AACAMACATCCATICCACACCAGTCOCTTTACE 


a a a aS 
6139 COGGCTCCCTTOGAGAATTIGAMGLCTATGLCTITCTIITATTATICAGATACGCCCTGTGTCTATATGGANGECATGGAATCTAMGCAGGTCGATTAT 


WO VPLRESATCEIRONLGOCAVCLKHAEEYREYLESY 
6238 GTCCCATTEAGANGCGL TACATCLATCACAMGATCCAA TT T NGG TGGCLCTGTT TET TAMAACATELTGAGGAGTATCETGAGTACCTTGAGTCTTAC 


we ATATTIAGFR TE WVYYKTFOFRYNMLWATFTIRLQSLER 
6337 AATACGBLAACCACAGL GGG TT TTACTITT TOGGTCTATAAGACTITTGATT ITIATAACCTTTGGAATACTTT TACT AGL TCCAMAST TT AGAAAAT 


mes YVY HC VM ACHFDGERAGCELPCAYIGEKYIAKIQA 
6436 GTAGTGTATAATTTGSTCAATGCIGGACACTTTGATCGCUSGGCGGGTGAACTCLCTIGTGCIGTTATAGGTGAGAAAGTCATTCOCAMGATTCAAMAT 


me EDV VYVFK MNT P PPT MV AVELFARRSIRPHPELK 
6535 GABGATGTCOTGSTCTTTAAAAATAACACGLEATICLOCACTAATETGOLTGTCGAATTATTYGCT ANGCECAGTATT CORLL CCACCECGAGLTT AN 


us LRFRNULNTOVOCWMSHVLYDYAKOS¥FCSSTYKVCK 
S434 CTCTTTAGAAATTTCAATATTOACSTGTECTECALTCACGTOCTTTSGKATTATOCTAAGUATACICIGTITTGCACTTCGACCTATANGETCTECAAA 


2164 YTIOLQCITESUCNVLEFOERONGCALEAFKKCRAGYY 
6733 TACACNGATTTACAGTGLATTGAAAGCTTGAATGTACTTTTTGATCRTICGTGATAATGSTGCICTTCAMGL TT TTANGAMGTOCUGLAATOSLGTCIAL 


2197 THTTK TR SLSMERGPQRADLNCYVVYEKYG DSOYV 
6832 ATTAACACRACAAAMATTAAAAGTCTGTCGATGATT AMAGSCLCALAALGTGLUGA TT TGAATERLGT AGT TG TGGNaAAMGT TGGAGATICTGATCTS 


MO EFWEAVR KDE DDVIFSRTGCSLEPSHYRSPQGHP 
6531 GAATTTTGGTITGCTGTOCETAANGACGLTCACCATGTTATCTICAGCCGT ACAGGGASCL TTGAACOGAGLCATT ACCGEAGCOCACANGGTAATECE 


MI CEONRVGCDOLSCHEALARGTIFTQOSRLLSSFTPARS 
TAS0 GGTGETAATCECKTERTEATCTCAGCGETAMTCAMGETCTAGCECGTGSCACTATCITTACTCAMMGCAGATTATTATECTCTTTEACACCTCEATCA 


‘CME K DF moO DD oD I Ae State tester rclactcsttrntrciasnt 
GAACACETT 


ChAT AOA MGATIATCGATTAGATCATATCTTTCATTECAATA TAGTTTACAGGACTACECSTTT GTTTATGETAGTITT 
WOKTICG LiL reetaRreoqgnrSutvilQgerviyvos 
TTA Trac ach Yc tA ASCASAMAATEAMCTCCTAATTRMGATTCSTXGAT TACCACTCT 


SUMS YFETOEASCSSKHSVCTV¥IDL LooFVoLyY 
AOCATTEATTESTCTTI AT CACTGACGAGAACAGTGTAGTAGTAMAGTOTCTCCACTCTTA! ITOATTTATTOTTMGATEATTTTCTECACATTCTA 


x SLMLKCWSKVVAVEVOFKOF QFRHLVCHREER YA 
AAGTCLCTEAATCTAMAGTCTGTCAGTAMGTICTTAATOTTAATETTEATTTT AAMGATTTCCASTTTATGT TEES TOLAATGAGGAGAMES CATE 
adrtictar ATOCETGTCTIATATAAGTATTTOONA 


LWayokKPITUPTGCOCRAAVAKYTQLOQYLS TIT. 
TIACTTTACET TTTGAGLACTACAACATTA 


Pe CTGGAATTATGGCAAGCOGATT |ACAGGATGTATGATGAATGTT GCTAMGTATACTCAAT TATGICAATA 


AVPAMHRYUNLCAGCSOKGYVAPESAVLROWLPAG 
CCAR TTCORSCT GRE TCTTOCOCCTOSSTCTGCASTICTT AGGCAGTOGL TACLAGLGGGA 
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TH WARY LF WRASTAWAGCGCAYSLFDASKFPLEAAGT 
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Figure 2. Nucleotide sequence of the ORF 1b region of the MHV-A59 polymerase 
gene and the predicted amino acid sequence of the major open reading frames. 
The most 5‘ AUG codon of ORF 1b and the junction sequence between the pol 
gene and the mRNA 2 coding region are underlined. 


of the sequence data revealed the 3’ end of an ORF (ORF Ia) 
partially overlapping with a large ORF (ORF 1b) which covers 
the 3’ half of the pol gene (Fig. 1). ORF!b has a length of 8199 
nucleotides and potentially encodes a protein of 309 kd (2733 
amino acids; Fig. 2). The first potential translation initiation 
codon is located at position 643 and the ORF terminates at 
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Figure 3. Comparison of the primary and secondary structure of the MHV-AS9 
and [BV-M42 ORF]la/ORF1b overlap A) Alignment of the nucleotide sequences. 
Numbers refer to the position of the 5’ G residue in the published sequence (MHV- 
ASS, this article; IBV, ref. 14). B) Predicted secondary and tertiary RNA structure 
at the site of ribosomal frameshifting. The potential codons involved in 
frameshifting are underlined, and the termination codons of ORF 1a are boxed. 
Solid lines illustrate the potential pseudoknot. Differences in the nucleotides 
involved in the predicted RNA structure of MHV in comparison to IBV are 
indicated in the predicted structure for IBV to illustrate the observed covariation. 


position 8443, which is just upstream of the conserved junction 
sequence AAAUCUAUAC. This region separates the gene 
encoding the 30 kd nonstructural protein from the pol gene (37). 
ORF ta terminates at position 318 and overlaps ORF 1b for 75 
nucleotides. 


Analysis of the ORFla/ORF1b overlapping sequence 


It has been shown that the nucleotide sequence of the IBV 
ORFla/ORFlb overlapping region is capable of inducing 
ribosomal frameshifting in vitro and in vivo (15, 16). A stable 
stem-loop structure in this overlapping region was predicted to 
be involved in this translational frameshifting. Comparison of 
the nucleotide sequence of the ORFla/ORF 1b overlap of MHV- 
A5S9 to the ORFla/ORF1b overlapping region of IBV-M42 
revealed a well conserved stretch of nucleotides (Fig. 3A). Fig. 
3B shows that a nearly identical stem-loop structure can be 
predicted for the MHV ORFla/ORF Ib overlap region. The 
insertion (MHV-AS9) or deletion (IBV-M42) resulting in a gap 
of three nucleotides in the alignment (Fig. 3A) is located in the 
bulge of the stem-loop structure (Fig. 3B). Even if larger regions 
of the sequence (up to 500 nucleotides) were analyzed for the 
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Figure 4. Analysis of ribosomal frameshifting in vitro and in vivo. A) Diagram showing the expression plasmid pAP 1 and the predicted sizes of protein products 
that would be expected from translation of the transcribed RNA. The black area represents the T7 promoter, regions encoded by the mutant M protein gene are 
hatched. Stop indicates the positions of the translation termination codons. B) /n vitro translation products were analyzed directly (lane 1) or after immunoprecipitation 
using moab J.1.3 (lane 2) or the carboxyl-terminal specific anti-peptide serum (lane 4), respectively. Lane 3 and 5; immunoprecipitations using the corresponding 
pre-immune sera. C) Lysates from pAP! transfected and vaccinia virus vTF7-3 infected cells were immunoprecipitated using moab J.1.3 (lane 1), the anti-peptide 
serum (lane 3) and the pre-immune sera (lane 2 and 4). M indicates molecular weight markers. 


presence of secondary structure the depicted hairpin structure still 
folds as a separate entity. 

In both IBV and MHV the translation termination codon of 
ORF 1a is located in this stem-loop structure. Apart from this 
conservation in secondary structure the potential tertiary structure 
is also well conserved. The nucleotide sequence of the loop gives 
rise to potential pseudoknot formation with sequences downstream 
of the proposed stem-loop structure. The significance of these 
proposed secondary and tertiary RNA structures is emphasized 
by the presence of covariation. Mutations in one part of the stem 
or in the nucleotide sequence of the loop are compensated by 
mutations in either the stem or in the downstream sequence 
involved in the potential pseudoknot, respectively (Fig. 3B). 


Ribosomal frameshifting in vitro and in vivo 


To prove that the ORF1a/ORF lb overlapping region of the MHV 
polymerase gene directs ribosomal frameshifting we cloned this 
region in a mutant M protein gene of MHV-A59 under the control 
of a T7 promoter. Termination of translation of pAP1 transcripts 
at the ORFla UAA stopcodon will result in the synthesis of a 


19 kd protein. However, when a —1 translational frameshift 
occurs a protein of 25 kd will be synthesized (Fig. 4A). Direct 
analysis of in vitro translation products revealed both proteins 
(Fig. 4B, lane 1). As expected moab J.1.3, directed against the 
N-terminus of the M protein, immunoprecipitated both the 19 
kd and 25 kd products, indicating that both proteins have a 
common N-terminus (Fig. 4B, lane 2). Only the 25 kd protein 
was specifically immunoprecipitated by the C-terminal anti- 
peptide serum (Fig 4B, lane 4). None of the translation products 
were immunoprecipitated by the pre-immune sera (Fig 4B, lane 
3 and 5). Since the methionine residues are only encoded by the 
region upstream of the ORFla/ORF1b overlap the frameshift 
efficiency can be easily estimated. The bands corresponding to 
the 19 kd and 25 kd products were excised from lane 1 (Fig. 
4B) of the dried gel and from the amount of radioactivity an 
efficiency of approximately 40% was calculated. 

To test whether the frameshift signal in the ORFla/ORF1b 
overlap was functional in vivo, Hela cells were infected with the 
vaccinia virus recombinant vTF7-3, expressing the T7 RNA 
polymerase and subsequently transfected with pAP1. Cells were 
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Figure 5. Proportional dot matrix comparison of the amino acid sequences from 
ORF 1b of MHV-AS59 and of IBV-M42. Numbers of the amino acid residues are 
indicated at the axis. For creating the dot-matrix plot the program ‘Compare’ 
of the Genetics Computer Group (Wisconsin) (25) was used with a window size 
of 21 and a stringency of 15. 


labelled with [35S]-methionine and cell lysates were 
immunoprecipitated using moab J.1.3 and the anti-peptide serum. 
Moab J.1.3 specifically immunoprecipitated the expected 
polypeptides of 19 kd and 25 kd from pAPI transfected and 
VTF7-3 infected cells (Fig. 4C, lane 1). These polypeptides were 


not present in lysates from cells that had only been infected 


with vTF7-3 (data not shown). The 25 kd protein was also 
immunoprecipitated by the anti-peptide antiserum directed against 
the carboxy]-terminus of the M protein. (Fig. 4C, lane 3). None 


of these proteins were precipitated by the pre-immune sera (Fig. 
4C, lanes 2 and 4). From these data it was concluded that the 
MHV-A59 ORF 1a/ORF 1b overlapping region was capable to 
induce ribosomal frameshifting both in vitro and in vivo. 


Computer analysis of the coronavirus polymerase genes 


Comparison of the predicted amino acid sequence of the products 
encoded by ORF 1b of MHV-AS59 and IBV-M42 revealed two 
large regions of high similarity (Fig. 5). The positional identity 
in an alignment of the amino acid sequence of ORF1b of MHV 
and IBV is 56%. 

Several short sequence motifs have been identified in particular 
polymerase proteins of RNA and DNA vinuses. One motif which 
contains the core sequence ‘GDD’ (38) has also been identified 
in the amino acid sequence of ORF 1b of IBV (39). This domain 
is conserved at an almost identical position in the product encoded 
by ORF 1b of MHV-AS59 (Fig. 6). However, in contrast to the 
“GDD’ amino acid sequence, both MHV and IBV contain the 
amino acid sequence ‘SDD’ at this position. Although 
occasionally a M, C, V or L residue has been reported in the 
position of the G residue (38), no serine residue has been reported 
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Figure 6. A) Localization of the ‘polymerase’ (B) and ‘helicase’ (C) domain in 
the second ORFs of the IBV and MHV-AS59 pol genes. B and C) Amino acid 
sequence of the conserved domains identified in ORF1b of MHV-AS59. Conserved 
residues are indicated by triangles. Numbers on the left refer to the localization 
of the conserved domains in the ORF1b sequence. Fig 6B; The ‘GDD’ or 
‘polymerase’ motif (38, 39). The position of the serine residue which replaces 
the glycine is indicated with an open triangle. Fig. 6C. The ‘GKS/T’ or ‘helicase’ 
motif (40, 41). 


immediately upstream of the two conserved aspartic acid residues. 
Analyzing the ORF1b amino acid sequence of MHV-A59 
revealed the presence of another ‘GDD’ motif at position 
2268—2270. Although it cannot be excluded that the ORF1Ib 
encodes more than one polymerase activity, it is unlikely that 
this ‘GDD’ motif is part of the active site of a coronavirus 
polymerase since the surrounding sequences do not meet the 
criteria proposed by Argos (38). Furthermore this ‘GDD’ 
sequence is not conserved between MHV and IBV. 

The amino acid residues encoded by ORF1b of IBV which 
exhibit similarity to a sequence motif found in a group of proteins 
from different organisms and which are probably involved in 
crucial nucleoside triphosphate dependent steps in nucleic acid 
replication (40, 41) are also present at a nearly identical position 
in the amino acid sequence encoded by ORFib of MHV-A59 
(Fig. 6). 

No other significant similarities were identified when 
overlapping regions of approximately 300 residues of the MHV- 
AS9 ORF 1b amino acid sequence were tested using the program 
FASTA (24) and the NBRF/PIR and NBRF/NEW protein 
identification resources (releases 19.0 and 37.0, respectively). 


DISCUSSION 


In this paper the primary structure of the second ORF of the 
putative MHV-A59 pol gene is described. Assuming that the 
organization of the polymerase gene of MHV is identical to the 
equivalent gene of IBV, in which two large ORFs have been 
identified (14), the nucleotide sequence of the smal! 5’ ORF 
presented in this article represents the 3’ end of a large ORF. 
This ORF starts at position 210 at the 5’ end of the viral genome 
(Fig. 1; ref. 1). 

No similarity has been detected in the predicted amino acid 
sequence of the 5’ end of ORFla of the IBV and MHV-JHM 
or MHV-AS9 polymerase gene (1, 12). However, the putative 
carboxyl terminal region of the translation product of ORF la and 
almost the complete translation product of ORFlb are well 
conserved among MHV-A59 and IBV. Recently, we have 
determined a small part (0.3 kb) of the nucleotide sequence of 
the 3’ end of the polymerase gene of the feline coronavirus FIPV. 
(De Groot, unpublished results). The deduced amino acid 
sequence of this region was very similar to the carboxyl terminal 
part of ORF 1b of MHV and IBV.In contrast to the high similarity 
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in the second ORF of the pol gene, no significant similarity has 
been observed in the amino acid sequence of the other viral 
nonstructural proteins. The structural proteins of coronaviruses 
only show significant similarity in relatively small regions. The 
overall identity in the nucleocapsid, membrane and spike protein 
is 29%, 30% and 35% respectively (reviewed by 4). This strongly 
suggests selective pressure against mutations in the pol gene to 
conserve functional domains. Identical observations have been 
made for the picornaviruses (42), alphaviruses (43), flaviviruses 
(44) and negative stranded RNA viruses like rhabdoviruses (45). 

The ‘S/GDD’ as well as the nucleotide triphosphate binding 
‘GKS/T’ amino acid sequence motif, which are encoded in the 
pol genes of coronaviruses, are also well conserved in the 
polymerases of viruses belonging to the picornavirus-like and 
alphavirus-like superfamily of (+) stranded RNA viruses (46, 
47). However, because of the quasi-helical nucleocapsid 
morphology and the expression of the viral genes by multiple 
subgenomic mRNAs, coronaviruses could not be assigned to 
either superfamily of RNA viruses (47). 

During the replication of coronaviruses multiple subgenomic 
RNAs are synthesized to position internal ORFs on the genome 
at the 5’ end of an MRNA. No subgenomic mRNA containing 
ORF 1b of the MHV polymerase gene at its 5'-proximal end has 
been detected in infected cells. Using an expression vector which 
contained the ORF1a/ORF 1b overlap of MHV inserted in frame 
within an MHV-AS59 mutant M protein gene construct, we were 
able to demonstrate that the ORFla/ORF 1b spanning sequence 
was capable of directing ribosomal frameshifting in vitro and in 
vivo. Brierley et al. (15, 16) have shown that the ORF 1la/ORF1b 
overlap region of IBV is also capable of inducing frameshifting 
in vitro and in vivo. Comparison of the nucleotide sequences of 
the overlapping region of IBV and MHV revealed that the signals 
used for ribosomal frameshifting in coronaviruses are well 
conserved. In both MHV and IBV a stable stem-loop structure 
can be formed downstream of the conserved sequence 
UUUAAAC. This sequence functions probably as the actual site 
for ribosomal frameshifting in MHV since mutations in this 
sequence have been shown to influence the ribosomal 
frameshifting in the IBV ORF1a/ORF tb overlapping region (16). 
It has also been shown to function as a site for ribosomal 
frameshifting in Rous sarcoma virus (48). The observed 
covariation between [BV and MHV in the predicted stem-loop 
structure and the pseudoknot underlines the importance of these 
Structures in translational frameshifting. Pseudoknots are also 
predicted to be involved in the ribosomal frameshifting for the 
expression of the polymerase gene of many retroviruses and the 
luteoviruses (16, Ten Dam and Pleij pers. communication). 
Recently, it has been shown for IBV that the proposed pseudoknot 
in the ORFla/ORF1b overlapping region is essential for 
ribosomal frameshifting (16). 

Frameshifting is more efficient in the coronaviral system 
(35-40% frameshifting) than in the retroviral system where only 
5—10% frameshifting has been observed (49). Ribosomal 
frameshifting is an elegant mechanism for regulating the synthesis 
of several proteins in a well balanced manner. In many 
retroviruses the polymerase is produced after translational 
frameshifting which results in the expression of a gag-pol fusion 
protein (48). Based on sequence comparison, it is postulated in 
this article that the polymerase function of MHV-AS9 is encoded 
downstream of the ribosomal frameshifting sequence and 
expressed as a fusion protein. It is tempting to speculate that this 
fusion protein is the actual polymerase. Cleavage of this functional 
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polyprotein could result in an inactive pol protein. Such a 
mechanism would explain the observed requirements for 
continuous de novo protein synthesis during the replication of 
MHV-AS9 (3, 6). 

The determination of the nucleotide sequence and the predicted 
amino acid sequence of MHV-AS9 ORF Ib will provide a basis 
for obtaining monospecific antisera against protein(s) encoded 
by ORFlb. These sera will be important for further 
characterization of the proteins involved in the discontinuous 
transcription of coronaviruses. 
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